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Abstract 

Karlin's (1982) Theorem 5.2 shows that linear systems alternating between 
growth and mixing phases have lower asymptotic growth with greater mixing. 
Here this result is extended to linear differential equations that combine site-specific 
growth or decay rates, and mixing between sites, showing that the spectral abscissa 
of a matrix D + raA decreases with m, where D 7^ cl is a real diagonal matrix, 
A is an irreducible matrix with non-negative off-diagonal elements (an ML- or es- 
sentially non-negative matrix), and m > 0. The result is based on the inequality: 
u T Av < r (A), where u and v are the left and right Perron vectors of the matrix 
D + A, and r(A) is the spectral abscissa and Perron root of A. The result gives 
an analytic solution to prior work that relied on two-site or numerical simulation of 
models of growth and mixing, such as source and sink ecological models, or mul- 
tiple tissue compartment models of microbe growth. The result has applications to 
the Lyapunov stability of perturbations in nonlinear systems. 

1 Introduction 

Growth and movement are ubiquitous phenomena in physical, biological, and social 
systems. In particular, site-specific growth and decay rates, combined with movement 
between sites, can be found throughout nature. An even wider range of phenomena 
may be included if we consider the formal equivalent: stare-specific growth rates, and 
tra nsformations between states. 

Karlin (1982) developed two very general theorems on the asymptotic growth rates 



of systems combining growth and movement. The context that motivated these devel- 
opments was rather narrow: analysis of the protection of genetic diversity in a sub- 
divided population undergoing natural selection and migration. But the theorems are 
fundamental, describing the long-term growth behavior of a wide range of coupled 
linear systems, and the stability of many nonlinear systems. 

Karlin's theorems apply to discrete time and discrete space systems in which a 
growth phase is followed by a movement phase, and these are iterated. This paper 
focuses on one of the theorems, Theorem 5.2, and extends it to apply to continuous 
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time and discrete space systems. Extensions to continuous time and continuous space 
systems entail greater technicalities and are deferred to elsewhere. 

Karlin's Theorem 5.2 states simply that in a system of objects undergoing exponen- 
tial growth or decay at different rates in different sites, the greater the level of mixing 
of objects between sites, the lower the long-term growth rate of the whole aggregation 
of objects. Here is the theorem, where p(M) is the spectral radius of matrix M, the 
largest magnitude of any eigenvalue of M: 

Theorem 5.2 ( Karlinl 1982 ). Let M be a general nonnegative irreducible backward 
migration matrix. Consider the family of migration matrices, 

M(a) = (1- a)I + aM. 



Then for any set of positive fitness values D = diag d\, di, 



p(M(a)D) 

is decreasing as a increases (strictly, provided D ^ dl). 

While the result is cast in terms of the specific context, it should be understood that 
M can be any irreducible stochastic matrix, and D any positive diagonal matrix. 

Karlin used the theorem to analyze the stability of coupled nonlinear systems, 
where each site i has a continuous, differentiable map fi(x) : [0, 1] M> [0, 1], so the 
coupled system is: 

x i (t+l) = Y^M ij f j (x j (t)) 
j 

In the population genetics context, fi(0) = 0, and the linearized stability dynamics for 
small x are 

x(t) = (M(a) D)'x(0), (1) 
d/i(0) 



where the diagonal elements of D are D, = 



Ax 



The zero solution x(t) = is 



unstable to perturbation if p(MD) > 1 and stable to perturbation if p(MD) < 1. 

The generality of this result is already evident in that no assumptions are made on 
M beyond that it be stochastic, and irreducible, which means there is a path of non-zero 
elements Mm x , Mk t k 2 > ■ • - > ^k c j between any i and any j ^ i. And no assumptions 
are placed on the set of nonlinear fy(x) other than that they be differentiable, and their 
domain and range be the unit interval, and /;(0) = 0. 

The first use of this th eorem outs i de its original context was to an a lyze the evolution 
of ge netic transmission d Altenberg , Il984t lAltenberg and Feldman, 1987; Altenberg, 
20091) . There, instead of objects moving between sites, the objects are genomes trans- 
forming between genotypes. Theorem 5.2 translates to the reduction principle: the zero 
solution for a gene that controls 'mixing' between genotypes is unstable to perturbation 
(i.e. to introduction of the gene to the population) if the gene reduces mixing. 

The use of this theorem in additional contexts, has to my knowledge, not yet oc- 
curred. This may be due, perhaps, to the limitation of the assumptions of discrete time 
and discrete space, or simply due to the small community of theoretical population 
geneticists familiar with it. 
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Objects: 


Sites: 


Site specific growth/decay rates: 


genes 


habitats 


fitnesses 


genomes 


genotypes 


fitnesses 


organisms 


habitats 


species survival and reproduction rates 


agricultural pests 


fields under treatment 


replication and survival 


microbes 


tissue compartments 


survival and replication rates 


metabolites 


tissue compartments 


catabolic rates 


reactants 


reaction-diffusion 
medium 


reaction rates 


wastes 


reactors 


waste breakdown rates 


photons 


media 


absorption rates 


particles 


heterogenous matter 


interaction and decay rates 


capital 


investments 


rates of return 



Table 1 : A short list of systems exhibiting site-specific growth or decay and mixing 
between sites. 



One may ask whether there is anything about discrete time and space that is essen- 
tial to the result, or whether there is a more general phenomenon that may extend to 
continuous time and space. Here I show that the result can be extended to continuous 
time and discrete space, namely, to coupled linear differential equations. The result 
here applies to any combination of constant exponential growth or decay rates at dif- 
ferent sites, and any constant pattern of movement between sites. The extension of 
Karlin's theory to linear first order differential equations brings a much wider domain 
of systems into its purview. One can contemplate a variety of systems that contain the 
applicable ingredients, shown in Table Q] 

2 The System 

The system investigated here is of the form 

^ = (D + mA)x(t), (2) 

which has solution 

x(()=e (D+mA)f x(0), (3) 

where 

x(£) £ R™ is an n-long vector of the quantities in each site at time t, 
D is a real n x n diagonal matrix of the growth rates at each site, 
m > is the global rate of mixing between sites, and 

A is a real n x n matrix that represents the movement distribution among n sites. A 
is an essentially non-negative matrix (also called a Metzler, Metzler-Leontief, or 
ML-matrix), defined by Mij > for i ^ j. 
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The matrices D and A hold the specifics for a particular system. The diagonal ma- 
trix D contains the growth rates, and under the action of D alone, the time trajectories 
([3]) would be 

Xi(jk) = e Dit Xi(0). 

Exponential growth or decay is determined by whether Di > or Di < 0. 

The matrix A represents movement between sites (or transformations between 
states). The form of variation in movement examined here is of the form: 

F(m) = D + to A, (4) 

where A represents the distribution of movement, while to represents the rate of move- 
ment. The question I address here is how the global level of 'mixing', to, affects 
the asymptotic growth rates of (01, and in particular, the stability of the zero solution 
x(t) = to perturbations. 

The form (0]i includes, as a special case, the form considered bv lKarlinl (119821) . in 
which a period of growth is followed by a period of movement: 

F(to) = [(1 - m)I + toP] D = D + to(PD - D), (5) 

where 

I is the identity matrix, and 
P is a stochastic matrix. 

In continuous time, both growth and movement occur simultaneously, so the analog to 
©is: 

F(to) =D + to(P-I). (6) 

A typical assumption about movement is that quantity is redistributed but con- 
served, in which case summing the effect of movement over all destinations produces 
zero net change: 

e T A = e T (PD — D) = 0, and e T (P I) = 0, (7) 

where 

e is the vectors of ones, and 

t is the transpose of the vector or matrix. 

This class includes the generator matrices of continuous time Markov chains (also 
called 'intensity' matrices). 

If quantity is lost during movement, (O is replaced by 

e T A = e T (PD - D) <^ 0, and e T (P - I) <^ 0. (8) 
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3 Results 

The general phenomenon to be shown is the following: 

Main Result The asymptotic growth rate of 

x(()=e (D+mA)t x(0), 

decreases with increasing values of the mixing parameter, m. If this system exhibits 
net growth, then greater mixing inhibits the rate of growth. If the system exhibits net 
decay, then greater mixing enhances the rate of decay. 

The asymptotic rate of growth or decay in (0) is given by the spectral abscissa 



ine asymptotic rate or growtn or decay in yj) is given by tne spectral abscissa 
of F(m ), which is the large st real part of any eigenvalue of F(m) dBernstein 20091 



p. 734. iGantmacherll 1959a, pp. 125-129). For real matrices that have non-negative 



off-diagonal elements (ML-matrices), the spectral abscissa is always an eigenvalue, 
referred to at the Perron root. Irreducible ML-matrices retain many of the properties 
of irreducible non-negative matrices, including the existence of positive left and right 
eigenvectors (the Perron vectors), unique up to scaling, that are associated with the 
spectral abscissa (the Perron root) ( Senetal 198 ll pp. 46—47). 



This paper considers how variation in the global mixing rate m varies the Perron 
root of F(m). 

Definitions. Let: 

be the vector with element i equal to 1, and other elements equal to zero; 

r (M) := maxi ReA^ (M) refer the spectral abscissa of a square matrix M; 

v(A) > refer to the right Perron vector of an irreducible ML-matrix A, normalized 
so that e T v(A) = 1; 

u( A) T > refer to the left Perron vector of an irreducible ML-matrix A, normalized 
so that u( A) T v(A) — 1 ( the different normalization is convenient later); So, 

u(A) T A = r(A) u(A) T , 
Av(A) = r(A) v(A), and 
u(A) T A v(A) = r(A). 



3.1 Basic Results 

The main result that will be used in the analysis is the following: 

Theorem: 1. Let A be an n x n irreducible real matrix with non-negative off-diagonal 
elements (an ML-matrix), and D be an n x n diagonal real matrix. Then: 

r(A + D) — r(A) < u(A + D) T D v(A + D) (9) 

with equality if and only ifD — c Ifor some cel. 
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Proof. The spectral abscissa of an i rreducible ML-m atrix, A, is its Perron root, which 
is given by this v ariational formula dFriedlandl 1981 , Corollary 3.1, related to the varia- 
ti onal formula of Donsker and Varadhanl (11975b . and shown to extend to ML-matrices 
Altenbergl2009l Lemma 31: 



in 



r(A) = sup inf > p. 
pe-p„ x>0 ^ 



[Ax], 



(10) 



where V n = {p: Pl > 0, £™ =1 = 1} c R" . 

Let x(A) and p(A) be the vectors, as functions of A, for which the sup and inf 
are attained, where x(A) is also normalized so t hat y]H_i a;,; (A) = 1. Then p(A) and 
x(A) are unique critic al points for a given A dFriedland and Karlinlll975t iFriedlandl 
198lt and lKarlin|[l98l p. 195), 



x(A)=v(A), 



(11) 



and 



p(A) = u(A) ov(A), 



(12) 



where o is element-wise the Sc hur-Hadamar d product. 

As utilized in the proof in iKarlin dl982l Theorem 5.2), since x(A) is a unique 
critical point in HOi , the inf means that any y ^ x(A) produces: 



r(A) ~ sup inf > 



[Ax] ; 



< 



[Ay], 



(13) 



Repeating the analogous step in the proof in IKarlin (1982, Theorem 5.2), let y 
v(A). Then 

|> (a + d) k™1 

(A)X>(A + D)^ + X>A + D,M 



i=l 



Vi(A 

n 

r(A) + 5^ ft (A + D)A 

2 = 1 

r(A) + u(A + D) T D v(A + D) 



Hence 



r(A) + u(A + D) T Dv(A + D) > r(A + D), 
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with equality if and only if v(A) = v(A + D), which entails 

(A + D)v(A) = r(A + D)v(A) = Av(A) + Dv(A) 
= r(A)v(A)+Dv(A), 

hence [r(A + D) — r(A)]Iv(A) = Dv(A); and since v(A) > 0, this implies D = 
c I, where c = r(A + D) - r(A). □ 

From Theorem[T]a number of ancillary results can be obtained: 

Corollary: 2 (Convexity Derived). Let A be an irreducible ML-matrix, and D a real 
diagonal matrix. Then, for (3 > 1: 

r(A + D) - r(A) < r(±A + d) - r(lA\ < u(A + (3~D) T D v(A + /3D), 

(14) 

with equality if and only ifD — c I for some c 6 R. 



Proof. Cohen (1979) established the convexity relation: 



(1 - a) r(A) + ar(A + D) > r((l - a) A + a(A + D)) (15) 



for non-negative A and D, and < a < 1. lFriedlandldl98lh showed that when A is 



irreducible, equality holds if and only if D = c I for some cel. This holds when A 
and D are ML-matrices by the relation r(A + cl) — c = r(A), since large enough c 
will guarantee A + cl > 0. 
Rearrangement of dT3T > gives: 



-(A + D) - r(A) > 
1 - ( 

I A + D)-r(iA)] 



- [r((l - a)A + a(A + D)) - r(A)l 



Application of (O gives 

u(A + D) T Dv(A + D) > r(A + D) - r(A) > r(^A + D) - r(^A). 

The condition for equality in both places is that D = c I for some cel. 
Multiplying by a, letting D' := aD, and (3 := 1/a: 

u(A + D) T aD v(A + D) > ar{A + D) - ar(A) > r(A + aD) - r(A) 

u(A + ^D') T D' v(A + /3D') > r^A + D') - tUa} > r(A + D') - r(A). 
Replacement of D' by D gives (Tl4"l >. □ 
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Corollary: 3. For irreducible ML-matrix A and real diagonal matrix D: 

u(A + D) T A v(A + D) < r(A), (16) 

with equality if and only ifG = c Ifor some cel. 

Proof. From TheoremQ] 

u(A + D) T D v(A + D) 

>r(A + D) - r(A) 

= u(A + D) T (A + D) v(A + D) - r(A) 
= u(A + D) T A v(A + D) 

+ u(A + D) T D v(A + D) - r(A), 

and rearranging, 

r(A) > u(A + D) T A v(A + D). 

The equality condition is unchanged from Theorem[T] □ 

Corollary: 4 (Sums). Lef D be a real diagonal matrix, and let A = X^fcLi Afe, w/zere 
{Afc} are ML-matrices that share a common right [left] Perron vector. Then 

N 

u(A + D) T Av(A + D)<^r(A fc ), (17) 

fc=i 

with equality if and only ifTJ = c Ifor some c 6 R. 
Proof. Letting y be the right common Perron vector, then 

N N 

Ay = r(A)y = ^ A 4 y = y ^ r(A 4 ), 

fc=i fc=i 

thus r(A) = Ylk=i r (-^-0' anc l substitution in (fT~6T > yields (fTTT i. For y equal to a 
common left Perron vector, A T is used. □ 

Corollary: 5 (The 'Flip' Theorem dBapat and Raghavani 1 19971 Theorem 3.2.5)). Let 

A be an irreducible ML-matrix. Let y o z = u(A) o v(A), with y > 0, z > 0. Then 

z T A y > u(A) T A v(A) = r(A). 

with equality if and only ify = v(A) or u(A) = v(A). 

In particular, when the left and right Perron vectors are 'flipped', 

v(A) T A u(A) > u(A) T A v(A) = r(A), 

if u(A) ^ v(A). 
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Proof. This is an alternative proof to that given in lBapat and Raghavanl d 1997b . and this 
extends Theorem 3.2.5 to ML-matrices. 

Substituting t/j = u i (A)w i (A)/zj and ( fT2l into ( fT3l , one gets: 

l — l L — l V ' \ // 

71 

= J2 z d A y]i =z T Ay, 

4 = 1 

with equality if and only if y = v(A) or u(A) = v(A). □ 

3.2 Main Result 



These results are now applied to extend Karlin's Theorem 5.2 (Karlin, 1982). Theorem 
5.2 applies to matrices F(m) = D + m A, where D is positive diagonal matrix, 
A = (P — I)D, P is an irreducible stochastic matrix, and < m < 1. Here, results 
are extended to D that may have negative diagonal elements, to arbitrary irreducible 
and reducible ML-matrices, A, and to any m > 0. 

Theorem: 6 (Growth and Mixing). Let D be a real n x n diagonal matrix, and A be 
a real n x n matrix with non-negative off-diagonal elements ( an ML-matrix). Then, for 
m > 0: 

Jr ' D /" A » < r( A), 0*> 
dm 

with equality holding if and only if either: 

1. D = c I/or some c £ K; or, 

2. (a) A is reducible; and 

(b) D K = c I K ,for some c G K,/or every k such that: 

i. k is an index on the diagonal blocks in the Frobenius normal form of 
A, each block being defined by a subset of the indices of A., 

ii. D K andl K are the restrictions o/D and I under the block of indices 
derived from A, and 

Hi. on an open neighborhood of m: 

r(D K + mA K ) = maxh riT)^ + mAj,) = r(D + mA). 

Moreover, for reducible A, with k defined as in \2(b)iii] a sharper inequality obtains: 

dr(D + mA) . . . 

dm * ^ * ^ 

with equality on the left side only under condition \2b\ 
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Proof. Case: Irreducible A. When F(m) is a C 2 function of m, and is an irreducible 
ML-matrix on so me open set around m, then the derivative of its spectral abscissa is 
dAltenberg[l2009l Lemma 4): 

r dF(m) , 

dr(F( '"»-^ P ,(,,.)'~ x( " l)1 ' 



dm Xi(m) 



where for clarity, p(m) : = p(F(m)) and x(m) := x(F(m)). Using ( fTTb and $1% one 
obtains the classical form (ICaswellLEoOOi Sec. 9.1.1): 



dr(F(m)) , T dF(m) 

= u(m) — v(m). 



dm 



Hence, 



Applying Corollary [3] 



dr(D + m A)) 
dm 



dm 



u(m) T A v(m). 



u(m) T (m A) v(m) < r(mA) 
u(m) T A v(m) < r(A), 



and thus 



dr(D + mA) 
dm 



u(m) T Av(m) < r(A), 



with equality if and only if D = c I for some cel. 

Case: Reducible A. If A is reducible, its Frobenius normal form is utilized. The 
Frobenius no rmal form, A, of a r educible matrix A permutes the indices until it has 
the structure (Gantmacher, I1959M p. 75): 



Ai 

















A 2 


















A t 








A t+i,i 


A-t+1,2 


■■■ M+i,t 


A t +i 










A-t+s,2 










A t+S 



(19) 



where the diagonal blocks A^ are irreducible square matrices. The eigenvalues of A 
are the eigenvalues of the irreducible diagonal block matrices A h dAltenbergl Eooi 
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Lemma 5). Therefore, the spectral abscissa for P(m) = D + m A is the maximum of 
the spectral abscissae: 

r(F(m)) = max r(Ff l (m)). 

h 

Hence, for every k that satisfies r(F K (m)) = maxfj r(F^(m)) on some neighborhood 
of m, the result on irreducible matrices yields: 

r(F(m)) = r(F K (m)) < r(A„), (20) 
dm dm 

with equality holding if and only if D K = c I K for some ceK for every k. 

Since r(A) = max;,, r(A/j) > r(A K ), (f20b is seen to be a sharper inequality than 

CDD: 

— r(P(m)) - — r(F„(m)) < r(A„) < r(A). □ 
dm dm 

Remark: Blocks Ai, . . . , A f in $1% are called isolated blocks. If x(0) is zero on 
any isolated block, it remains zero on that block for all t > 0. Should r(A) derive 
from isolated blocks, then the system will asymptotically grow at rate r(A) only if 
x(0) has non-zero elements for one of those blocks. Therefore, the asymptotic growth 
rate for reducible A may depend on the initial value x(0), whereas for irreducible A it 
is independent of any initial value x(0) >^ 0. 



3.3 Conservative and Lossy Mixing 

Inherent to the concepts of movement and mixing is the idea that the redistributed quan- 
tity is conserved or perhaps lost by the movement process, but never created. Theorem 
[6] is more general and does not assume this. But when the assumption that movement 
is conservative or lossy is made, it yields the following results as special cases of The- 
orem[6l 

Corollary: 7 (Measure-Preserving Mixing). If ML-matrix A is such that mixing pre- 
serves the total measure of quantity, then 

dr(D + mA) ^ 
dm 

with equality under the same conditions as Theorem® 

Proof. Measure-preserving mixing means e T A = 0, in which case r(A) = 0, and 
this is applied to dT8V □ 

Corollary: 8 (Lossy Mixing). If ML-matrix A is such that quantity is lost in mixing, 
then 

dr(D + mA) 

—4- '- <0, 

dm 



for any real diagonal matrix D. 
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Proof . Lossy mixing means e T A <=/= 0, which implies by subinvariance ( Senetal 



198ll Corollary 3, p. 52) that r(A) < 0, and this is applied to dT8]l. □ 
Lemma: 9 (Bounds). Let r(A) = 0. Then: 

r(D + mA) e [min(A),max(A)] 



dr(A) 



Proof. We know from lCohenl (119781) that — — — > for any ML-matrix A. So here, 

uAij 

r(D + mA) > 0. Hence 

r(maxi Dil + mA) = max Di + m r( A) = max Di 

i i 

> r(D + mA) > r(rnirij Dil + mA) 

= min Di . □ 

i 

Theorem: 10 (Limit). For any choice of \ 6 [max^ (Di), min^-Dj)], there exists a 
family of ML-matrices, J- C {A: r(A) = 0}, that yields 

lim r(D + m A) = A /or Ae J. 

m— >-oo 

(9ne such family is: 

n 

T = {ae T — I: 2_. a iDi = A, a, > 0, and e J a. = 1}. 

i=i 

Proof. First, let cc > 0, which makes matrices ae T and F(m) := D + m ae T 
irreducible, giving F(m) a unique positive right Perron vector, v(m). Hence 

r(F(m)) v(m) = Dv(m) + ma e T v(m) — m v(m) 

= Dv(m) + m(oc — v(m)) (21) 

m«j «i 

Ul(m) %(F(m))- A + m = r(F(m)) - D~ ~ 

m ' 1 

By Lemma|9l r(F(m)) is bounded, hence 

lim Vi(m) = lim 



m— >-oo m— foo 

m 



r(F(m))-D | x 

When the elements in (BTt are summed: 

?'(F(m)) e T v(m) = r(F(m)) = e T Dv(m) + me 1 (a — v(m)) 

= e T Dv(m) + m(l — 1) 
= e T Dv(m). 
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Hence, 



lim r(F(m)) = lim e T Dv(m) = e T Da 



2 ,Aaj. 

i=l 



The assumption that a > allows some solution to A<*i = A f° r anv 

A G (maxj(Dj), minj(Z)j)). But for A = A max or A = Di min , where i max and i m j n 
are the indices producing maxj(A) and min.j(A), tnen a = e Wx or a — e imi D > 
respectively. In these cases, D + m ae T is no longer irreducible. The Frobenius 
normal form for F(to) = D + m(eie T — I) is 



F(m) = 



Di — m 











D 2 - rn 



Dj — m 



m 



D, 



Hence, the eigenvalues of D + rn(e,e T — I) are the diagonal elements {Dj — m: j ^ 
i} U {Di}. Thus when m > maxj(_D 3 — Dj) then r(F(m)) = Di > Dj — m for all 
3+i. So, 



Letting i 



lim r(D + m(eie T — I)) = Di 

m— J-oo 

and i = i m j n completes the construction. 



□ 



Corollary: 11 (Stability). Let F(m) = D + m A ai in Theorem® Suppose that 
r(A) < 0. If the solution x(t) = is unstable under (O for some m*, then it is 
unstable for all < m < m*. 

Proof. Instability means that r(F(m)) > 0. By Theorem|6] when r(A) < 0, 

dr(F(m)) 



dm 



< r(A) < 0. 



Hence, if r(F(m*)) > 0, then r(F(m)) > r(F(m*)) > for all < m < m*. □ 



3.4 Additional Results 

Corollary: 12 (Heterogeneity). Under low enough mixing, heterogeneity of growth 
rates always produces greater asymptotic growth than the average of the growth rates: 

1 - 

3m* > : V < m < m* : r(D + mA) > - > A- 

n ^— ' 

Proof When D ^ c I for any c G R, then r(D) = max, (A) > ^ E"=i A- Since 
the eigenvalues are continuous functions of the entries of D + m A, there is some 
neighborhood (0, m*) where r(D + mA) > ~ Ya=i A for to G (0, to*). □ 
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Theorem: 13 (Convexity). Let D be a real diagonal matrix, and A be an irreducible 
ML-matrix. Then r(D + to A) is convex in m. Specifically: 
For any mi , to 2 > 0, mi ^ to 2 and < a < 1, one has: 

r((l - a) (D + mi A) + a (D + m 2 A)) 

< (1 - a) r(D + miA) + a r(D + m 2 A), (22) 
with equality if and only ifT) = c I, /or some c G R. 

Pr oof. Convexity of r(D + to A) with respect to diagonal matrix D was established 



:ity 

by ICohenl (1 19791 Theorem 3). Specifically, for real diagonal matrices D, and ML- 



matrix A, for < P < 1: 



r((l-£)Di+£D 2 + A)) 

< (1 - P) r(Di + A) + j8 r(D 2 + A). (23) 



Friedland (1981, Theorem 4.1) showed further that equality in (1231 obtains if and only 
if Di — D 2 = c I for some cel. 

Convexity with respect to the diagonal matrix implies convexity with respect to m, 
which can be seen by morphing d22l into (1231 through the identity 

r(D + m A) = m r(^D + A) : 

r(D + [(1 — a)mi + 07713] A)) 

= f(l — o)toi + OTOol r \ rn r^— ; tD + A 

< (1 - a) r(D + mi A) + a r(D + to 2 A) 
= (l-o)mi r(^D + A) 

+ cito 2 rM^D + A). 
Dividing both sides by (1 — a)mi + am 2 gives: 
r(m t^-t rD + A] 

\ [(1 — a)m 1 +am 2 \ ) 

(1 — o)mi / 1 ^ 

< 7 — ^ — — r[ — D + A 

(1 — o)mi + aTO 2 \ mi 

ami 



(1 — o)toi + OTO 2 



fe D + A )- (24) 



Now, define 



and 



Di := — D, D 2 := — D, 

TOi TO 2 



OTO 2 (1 - o)TOi 

-, SO I — p — 



(1 — o)mi + aTO 2 ' (1 — o)mi + oto 2 
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Thus (l24l > becomes: 

r((l-0)Di+0D 2 + A) 

< (1-/3) r(D!+A) + /3 r(D 2 + A), (25) 

which is (123V The equality condition, Di — D 2 = c I for some c G R, becomes 
Di — D 2 = (1/mi — 1/to2)D = c I. Since mi ^ mi, this is precluded if D 7^ c I 
for any c e R, in which case r(F(m)) is strictly convex in m. □ 



4 Discussion 

The proximate motivation for extending Karlin's Theorem 5.2 from m aps to differen- 



tial equations was a theoretical study by Stei nmever and Wi lke (2009) on the effect of 



tissue compartments for antiviral, lethal mutagenesis therapy. The background of the 
problem is that a number of anti-viral agents seem to work by mutating the virus to 
inviability. But different tissues can concentrate the therapeutic mutagen to different 
concentrations. The greater the concentration, the lower that the viral replication rate 
beco mes. Thus the virus will hav e different growth rates in different compartments. 
Steinmever and Wilkd d2009h ask how heterogeneity in mutagen concentrations, 



and movement of virions between compartments, affects the dosage needed to cure the 
infection. They obtain analytic results for a two compartment model, but for more than 
two compartments must resort to high m and low m limits, course grained approxima- 
tions, and numerical studies, to obtain results. 

Their results exhibit a number of salient features. Here it is noted where each of 
these features is proven analytically: 

1 . Compartmental heterogeneity of growth rates increases long-term viral growth 
rates above the average of the compartment growth rates (Corollary IT2b; 

2. Greater viral mixing between compartments decreases the long-term total viral 
growth rate (Theorem[6]i; 

3. The decrease in viral growth rate is convex in the amount of mixing (Theorem 

H2J; 

4. The whole body viral growth rate for all levels of mixing is bounded above by 
the maximum growth rate in any compartment (Lemma|9]i; 

5. As the amount mixing of virions between compartments increases, it converges 
to a weighted average of the compartmental growth rates (TheoremfTOt. 

As can be seen, each of these qualitative fea t ures o bserved in the two-compartment and 
numerical results of ISteinmever and Wilk e (2009) are in fact properties that extend to 



any number of compartments, arbitrary growth rates among compartments, patterns of 
mixing between compartments, and magnitudes of mixing. 
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4.1 Wider Application 

An intimated in TableQ] the results here may find application in a diverse array of sys- 
tems. The model for v iral replication in multiple tissue compartments in 
Steinmever and Wilke (2009) has the same form as ecological mo dels of sink and 



source populations ( Holj[l996t Armsworth and R oughgarden, 2005). 



In addition to linear dynamical systems of the form (0, many nonlinear systems 
may have (0 as the dynamic of small perturbations. It would be informative to col- 
lect other examples of models of the form (fJJ from the literature in the various fields 
mentioned in Table [T] 

4.2 Further Extensions 

One of the important sources of ML-matrices is the numerical solution of second-order 
partial differential equations. The second derivative can be discretely approximated by 
the cen tered differenc e method, which in one dimension generates a tri-diagonal ML- 



matrix (Beattie, 2007, p. 38-4), with Is along the super- and sub-diagonals, and —2 



along the diagonal. Thus, a centered difference approximation to 

dx(s,t) , . . . d 2 x(s,t) ,„^ s 
— = g(s) x(s, t) + m (26) 

with the proper conditions should be of the form (ffji, and the results here would ap- 
ply. The second derivative in higher dimensions , such as the Laplace oper ator, also has 
discre te approximations that are ML-matrices Jn3 |2007[ pp. 40-1 - 40-2; iGreenbauml 



20071 pp. 41-1 - 41-2), so one can seek analogous results in higher dimensional diffu- 



sions. 

Under the analytic assumptions for which the solutions of the centered difference 
approximation converge to positive eigenfunction solutions of d2"6i i, one can expect that 
the Perron root will be a decreasing function of m. 

Here I have only touched upon some applications and extensions one might find 
for the results presented. The fact that Karlin's result — that mixing reduces growth 
— extends from discrete-time/discrete space systems to continuous-time/discrete space 
systems, and promises to extend further to continuous time and space systems, and to 
diffusion operators, suggests something fundamental in the phenomenon. 
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